# Vision Encoder-Decoder
Vit Base Patch16 224 Turkish Gpt2
Apache-2.0
This is a vision encoder-decoder model based on ViT and Turkish GPT2 for generating Turkish image descriptions.
Image-to-Text
Transformers Other

V
atasoglu
20
2
Trocr Small Korean
Apache-2.0
TrOCR is a Korean image-to-text model based on a vision encoder-decoder architecture, using DeiT as the image encoder and RoBERTa as the text decoder.
Image-to-Text Korean
T
team-lucid
342
17
Featured Recommended AI Models